Text-conditioned Transformer for automatic pronunciation error detection

نویسندگان

چکیده

Automatic pronunciation error detection (APED) plays an important role in the domain of language learning. As for previous ASR-based APED methods, decoded results need to be aligned with target text so that errors can found out. However, since decoding process and alignment are independent, prior knowledge about is not fully utilized. In this paper, we propose use as extra condition Transformer backbone handle task. The proposed method output states consideration relationship between input speech a end-to-end fashion. Meanwhile, used decoder input, works feed-forward manner instead autoregressive inference stage, which significantly boost speed actual deployment. We set baseline model conduct several experiments on L2-Arctic dataset. demonstrate our approach obtain 8.4% relative improvement F1 score metric.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing different approaches for automatic pronunciation error detection

One of the biggest challenges in designing computer assisted language learning (CALL) applications that provide automatic feedback on pronunciation errors consists in reliably detecting the pronunciation errors at such a detailed level that the information provided can be useful to learners. In our research we investigate pronunciation errors frequently made by foreigners learning Dutch as a se...

متن کامل

Automatic pronunciation error detection and guidance for foreign language learning

We propose an e ective application of speech recognition to foreign language pronunciation learning. The objective of our system is to detect pronunciation errors and provide diagnostic feedback through speech processing and recognition methods. Automatic pronunciation error detection is used for two kinds of mispronunciation, that is mistake and linguistical inheritance. The correlation betwee...

متن کامل

Automatic Pronunciation Error Detection Based on Extended Pronunciation Space Using the Unsupervised Clustering of Pronunciation Errors

Calculating posterior probability within a standard pronunciation space (SPS) is a common method in automatic pronunciation error detection (APED). However, to pronunciation errors outside the SPS, this kind of methods can only give an approximate solution, that may be not right in many applications. This paper expands the SPS to include more pronunciation errors, introduces a Bhattacharyya dis...

متن کامل

Automatic pronunciation error detection: an acoustic-phonetic approach

In this paper, we present an acoustic-phonetic approach to automatic pronunciation error detection. Classifiers using techniques such as Linear Discriminant Analysis or a decision tree were developed for three sounds that are frequently pronounced incorrectly by L2-learners of Dutch: /A/, /Y/ and /x/. The acoustic properties of these pronunciation errors were examined so as to define a number o...

متن کامل

Automatic Error Recovery for Pronunciation Dictionaries

In this paper, we present our latest investigations on pronunciation modeling and its impact on ASR. We propose completely automatic methods to detect, remove, and substitute inconsistent or flawed entries in pronunciation dictionaries. The experiments were conducted on different tasks, namely (1) word-pronunciation pairs from the Czech, English, French, German, Polish, and Spanish Wiktionary [...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Speech Communication

سال: 2021

ISSN: ['1872-7182', '0167-6393']

DOI: https://doi.org/10.1016/j.specom.2021.04.004